Automatic Creation of a Conceptual Base for Portuguese using Clustering Techniques
نویسندگان
چکیده
▶ Following [2]... 1. Split the original network into sub-networks 2. Calculate the frequency-weighted adjacency matrix F of each sub-network; 3. Fij = Fij + Fij ∗ , −0.5 < < 0.5; 4. Run MCL [3], with = 1.6, over F for 30 times; 5. Use the (hard) clustering from each run to create P, a matrix with the probabilities of each pair of words in F belonging to the same cluster; 6. Remove: (a) big clusters, B, if there is a group of clusters C = C1,C2, ...Cn such that B = C1 ∪ C2 ∪ ... ∪ Cn; (b) clusters completely included in other clusters.
منابع مشابه
Automatic Discovery of Fuzzy Synsets from Dictionary Definitions
In order to deal with ambiguity in natural language, it is common to organise words, according to their senses, in synsets, which are groups of synonymous words that can be seen as concepts. The manual creation of a broad-coverage synset base is a timeconsuming task, so we take advantage of dictionary definitions for extracting synonymy pairs and clustering for identifying synsets. Since word s...
متن کاملAutomatic Detection and Localization of Surface Cracks in Continuously Cast Hot Steel Slabs Using Digital Image Analysis Techniques
Quality inspection is an indispensable part of modern industrial manufacturing. Steel as a major industry requires constant surveillance and supervision through its various stages of production. Continuous casting is a critical step in the steel manufacturing process in which molten steel is solidified into a semi-finished product called slab. Once the slab is released from the casting unit, th...
متن کاملAutomatic Segmentation of the Gross Tumor Volume in Prostate Carcinoma Using Fuzzy Clustering in Gallium-68 PSMA PET/CT Scan
Introduction: Modern radiotherapy (RT) techniques allow a highly precise deposition of the radiation dose in tumor. So, high conformal tumor doses can be reached while sparing critical organs at risk. Materials and Methods: This study was conducted in three phases. In the first phase; Fourteen patients with primary or recurrent prostate cancer receive Gallium-...
متن کاملOnto.PT: Automatic Construction of a Lexical Ontology for Portuguese
This ongoing research presents an alternative to the manual creation of lexical resources and proposes an approach towards the automatic construction of a lexical ontology for Portuguese. Textual sources are exploited in order to obtain a lexical network based on terms and, after clustering and mapping, a wordnet-like lexical ontology is created. At the end of the paper, current results are shown.
متن کاملImproved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring
In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...
متن کامل